Epidemic Spreading with External Agents 

Siddhartha Banerjee, Aditya Gopalan, Abhik Kumar Das, and Sanjay Shakkottai 



Abstract — We study the spread of epidemics in large networks 
when the spread is assisted by a small number of external 
agents: infection sources with bounded spreading power, but 
whose movement is unrestricted vis-a-vis the underlying network 
topology. For networks which are 'spatially constrained', we show 
that the spread of infection can be significantly speeded up even 
by a few such agents infecting randomly. More specifically, for 
general networks with bounded external virulence (e.g., a single 
or finite number of random mobile agents), we derive upper- 
bounds on the order of the infection time as a function of network 
size. Conversely, for certain common classes of networks such as 
line graphs, grids and random geometric graphs, we also derive 
lower bounds on the order of the spreading time over all (po- 
tentially network-state aware and adversarial) external infection- 
spreading policies; these adversarial lower bounds match (up to 
logarithmic factors) the spreading time achieved by an external 
agent with random mobility. This demonstrates that random, 
state-oblivious infection-spreading by an external agent is in fact 
order-wise optimal for dissemination in such spatially constrained 
networks. 

Index Terms — Epidemic processes, infection/information 
spreading, long-range contact, mobility. 

I. Introduction 

Various natural and engineered phenomena around us in- 
volve the spread of information or infection through different 
kinds of networks. Rumours and news stories propagate among 
people linked by various means of communication, diseases 
diffuse as epidemics through populations by various modes, 
plants disperse pollen/seeds and thus genetic traits geographi- 
cally, riots spread across pockets of communities, advertisers 
aim to disseminate information about goods through net- 
works of consumers, and computer viruses, email worms and 
software patches piggyback across computer networks. Un- 
derstanding how infection/information/innovation can travel 
across networks through such epidemic processes has been a 
subject of extensive study in disciplines ranging from epidemi- 
ology (2, 0, HI, sociology 0, |6| and computer science 
EL El, to physics flOl . information theory/networking 
ED, E3, 03, 01, EH and applied mathematics lfl6l. ifTTl. 
[ 18 1, yielding valuable insights into qualitative and quantitative 
aspects of spreading behaviour in networks. 

At a high level, the spread of such infections in networks 
is due to the interaction of two processes-(i) a local spreading 
process in the underlying network, and (ii) a global infection 
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process due to agents that are external to the network. Based on 
this observation, we propose and study a model for spreading 
in networks that decomposes into two distinct components - 
a basic intrinsic spread component in which infection spreads 
locally among neighbouring nodes due to the underlying graph 
topology, and an additional external spread component in 
which 'external agents' (potentially unconstrained by the un- 
derlying graph) can carry infection far from its origin, helping 
it spread globally. More specifically, we develop a rigorous 
framework with which we quantify the effect that a number 
of omniscient (i.e. network-state aware) and adversarial (i.e. 
attempting to maximize the rate of infection) external infection 
agents can have on the time required to spread infection 
throughout the network. 

We stress that the generic terms 'intrinsic spread' and 
'external spread' serve to model a variety of situations in- 
volving heterogeneous modes of spreading. In the context of 
wireless communication, for instance, consider the increas- 
ingly studied propagation |fl9ll , ||20| , ll2Tl . Il22l of viruses 
and worms that exploit the connectivity afforded by both 
modern short-range personal communication technologies like 
Bluetooth and long-range media such as SMS/MMS and 
the Internet. To paraphrase Kleinberg 11231 . outbreaks due to 
such worms are well-modelled by local spreading on a fixed 
network of nodes in space (i.e. short-range Bluetooth wire- 
less transmissions between neighbouring quasi-static users) 
aided by relatively unrestricted paths through the network 
(i.e. long-range, faster-timescale emails and messages through 
SMS/MMS/Internet). Other, more classically-studied, exam- 
ples of multiscale spreading include those of natural disease 
epidemics [2|, [24- 1 and bioterror attacks E5l . where infection 
can spread locally through spatial pathways (i.e. interpersonal 
contact) and through large-scale geographic means (e.g. human 
movement through airline routes) ll26l . In all these and allied 
cases, a form of agency, external to the underlying graph, is 
responsible for long-range proliferation of an otherwise locally 
diffusive contagion, and it is the effect of this external agency 
that we wish to investigate. 

Given the applicability of our epidemic model with external- 
infection sources, a fundamental characterization of the impact 
of external agency on epidemic spread has a twofold utility: 

(a) (Adversarial perspective) Whenever malicious epi- 
demics (such as those described before) threaten to 
spread via both intrinsic and external means, it be- 
comes important to understand the worst-case long- 
range spreading behaviour (this is the component that 
can potentially accelerate the spread) in order to deploy 
appropriate countermeasures. 

(b) ( Optimization perspective) In cases where propagation is 
desirable and the external component can be controlled, 
an adversarial study of external-agent assisted spreading 
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can constructively help design fast mobile spread strate- 
gies (e.g. viral advertising |9|, network protocol design 
[27 1 and diffusion of innovations J5)). 

A. Main Contributions 

We consider large graphs G — (V, E) in which an epidemic 
starts at a designated node and commences spreading through 
two interacting processes: an intrinsic spread which follows 
the standard Susceptible-Infected (SI) dynamics ifTTIl (also 
termed contact process ifTBl , ifTTl ) with Ltd. exponentially 
distributed propagation times for each edge, and an additional 
external infection. To model the external infection process, 
we allow every node in the graph to get infected at a 
potentially different (including zero) exponential rate at each 
instant; thus at time t, the state of the network consists of 
a set of nodes which are infected (therefore determining the 
intrinsic infection process) and a \V\ -dimensional vector L(t) 
of external infection-rates for each node. The virulence of the 
external agents is measured by ||i(£)||i (i.e., the sum rate of 
external infection). We allow L(t) to be chosen as a function 
of the network state and history {omniscience) and further, 
chosen adversarially, i.e., designed to minimize the time taken 
to infect all nodes; in Section [II] we discuss how this model 
generalizes various models for long-range infection spreading. 

In this setting, our main message is somewhat surprising- 
in spite of the 'adversarial power' external agents have for 
spreading infection, we show that for common spatially- 
constrained graphs (i.e., having high diameter/low conduc- 
tance), a simple random strategy is order-optimal. More for- 
mally, our contributions in this paper are as follows: 

(a) We develop general upper bounds on the order of the 
infection time (expectation and concentration results) 
for large graphs when the external infection pattern is 
random, i.e. when every node is susceptible to the same 
external-infection rate, irrespective of the infection-state 
and graph topology. The bounds are based on the graph 
topology (in particular, diameter/conductance of appro- 
priate subgraphs) and a lower bound i m ; n (n) on the 
virulence (which can scale with the network size). We 
also analyze an alternate greedy infection policy based 
on the same graph partitioning scheme, for which we 
obtain better bounds. 

(b) For common classes of structured graphs (ring/line 
graphs, d-dimensional grids) and random graphs (geo- 
metric random graphs) which have high diameter/low 
conductance (spatially-constrained), and given an upper 
bound L max (n) on the virulence, we use first-passage 
percolation theory IfTTl to derive lower bounds on the 
order of infection times (again, both in expectation and 
w.h.p) over all (possibly omniscient and adversarial) 
external-infection policies. These lower bounds match 
the upper bounds for random spreading up to logarithmic 
factors, showing that random external-infection policies 
are order-wise optimal for such spatially-constrained 
graphs. Furthermore, they match exactly for the greedy 
policies, indicating that these bounds are tight. 



(c) Apart from results for specific graphs and policies, the 
general bounds (and related techniques) are of inde- 
pendent interest in that they provide a fairly complete 
picture of the dependence of spreading time on external 
virulence and graph topology in a wide regime; in par- 
ticular, it is tight for graphs with polynomially -bounded 
diameter (i.e., diameter D(n) = il(n a ) for some a > 0) 
and sub-linear external virulence (i.e.,||L(i)|| 1 = o(n)). 
To demonstrate this, we discuss how other spreading 
models (graphs with additional static or dynamic edges, 
and/or mobile agents) can be analyzed in our framework, 
and what our bounds translate to in such cases. 

B. Related Work 

Prior work concerning network spread, though diverse in 
scope and treatment, does not address the impact of adversar- 
ial external-agent-assisted spreading in spatially-constrained 
networks. Moreover, to the best of our knowledge, it lacks 
a consistent analytical framework in which the effects of 
different forms of external agency on spreading time can 
be compared. There has been much work in studying the 
static spread of infection/innovation using various notions 
of influence and susceptibility, both numerically using field 
data/extensive simulations iflOl . [5|, |6], |7|, |28| and analyti- 
cally Hi], O, Oil, HI, E9). For the case of spreading via 
external agents, many numerical studies have investigated the 
spread of infectious diseases with specific mobility patterns, 
e.g. via airline networks |30) . (26), heterogeneous geographic 
means (2), Gi, and recently, electronic pathways |23l . [19|, 
ED) . ED . E2) . Several notable works in communication 
engineering include studies in which all network nodes are 
simultaneously mobile- for designing gossip algorithms lfl5l . 
ETll or improving the capacity of wireless networks [14| -and 
analysis of rumour spreading on fully-connected graphs [12|, 
lfl3ll . Other design-oriented studies include the investigation 
of optimal seeding in networks for maximum spread from 
a computational perspective [9], efficient routing over spatial 
networks with fixed long-range links [8] and tight bounds on 
deterministic spreading with external-agents in <i-dimensional 
hypercubes ||3D . 

II. Model for Epidemics with External Agents 

Consider a sequence of graphs G n — {V n ,E n ) indexed by 
n, with the n-th graph having n nodes (for ease of notation, we 
label the nodes in V from 1 to n). For instance, G n could be 
the ring graph with n nodes, or a (2-dimensional) y/ri x y/n 
grid. For convenience, we will drop the subscript n for all 
quantities pertaining to the graph G n when the context is clear. 

We model the spread of an epidemic on the underlying 
graph G n (or G) using a continuous-time spreading process 
(S(t)) t >o. At each time t, S(t) = (Si(t), . . . , S n (t)) G 
{0, 1} V denotes the 'infection state' of the nodes in V: 
Si(t) — (resp. Si(t) — 1) indicates that node i G V is 
'healthy' (resp. 'infected') at time t. Let us denote by S(t) the 
set of infected nodes at time t, i.e. S(t) = {i G V : Si(t) = 1}, 
and use J\f(S(t)) to denote its size. In order to capture the 
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effect of external agents, the evolution of S(t) is assumed to 
be driven by the following modes of infection spread: 

• Intrinsic infection: Initially, at t = 0, all nodes are 
healthy, except for a single node (node 1, say) that is 
infected. Once any node is infected, it always remains 
infected, and infects each of its neighbouring (w.r.t. G) 
healthy nodes at random times, independent and expo- 
nentially distributed with mean 1. We term this form of 
spread as infection via basic contact. 

• External infection: At time t, in addition to being infected 
by its neighbours in G, each node i is susceptible to an 
external infection (or alternately, a long-range contact) 
with an exponential infection-rate Li(t). The external 
infection-rate vector L(t) = (Li(t))i e v can vary with 
time t and can depend on the state of the network S(t) 

We note here that the dependence of the external infection rate 
Lit) on the network state allows us to model the propagation 
of infection through a wide range of external infection pro- 
cesses transcending the structure of the underlying network 
(G). For instance, 

(a) L(t) — represents infection occurring only through 
edges of the underlying graph (the standard contact 
process). 

(b) Well-known studies by Kleinberg ll32l and Watts- 
Strogatz ll33l show that adding a few fixed long-range 
edges onto structured networks can dramatically reduce 
routing time and diameter. The above long-range contact 
model captures the dynamics of infection spreading with 
L such additional edges, say, by letting Li(t) be the 
number of long-range edges incident on node i that have 
an infected node at the other end at time t. 

(c) Long-range edges over the underlying graph, instead of 
being drawn in a static manner, can be dynamically 
added and deleted as time progresses. For instance, 
infected nodes can "throw out" fresh sets of long-range 
edges at certain times -this corresponds to choosing 
fresh sets of long-range infection targets depending on 
network state or other parameters. 

(d) Moving beyond long-range structures, the external infec- 
tion vector can also be used to model "virtual mobility"; 
the external infection could be caused by one or several 
mobile agents, whose position is unconstrained by the 
graph, and which thus spread infection to various parts 
of the network with corresponding rates L{f). 

(e) At an even more abstract level, the external agent can 
be viewed as an external information source with band- 
width 1 1 L(t) which can share its bandwidth across 
nodes of the graph. Such a model can be used to design 
optimal spreading processes for viral advertising, spread 
of software updates, etc. 

Throughout this paper, we have used the term intrinsic 
infection (likewise external infection) interchangeably with 
basic contact (likewise long-range contact). Now, to complete 
our system description, we term the quantity ||Z(t)||i as the 
long-range virulence at time t, i.e., the power of the infection 
to spread through long-range contact. In this work, we restrict 
ourselves to scenarios where L(t) is uniformly upper and 



lower bounded by functions L max (ri), L m i n (n) respectively 
(that can potentially scale with the network size ri). Finally, 
we define the finish time or spreading time of the epidemic 
as T = inf{i > : S(t) = 1„}, i.e., the time at which all 
nodes in V get infected. Our concerns are both to (a) analyze 
the finish time under certain natural long-range spreading 
dynamics, and (b) show universal lower-bounds on the finish 
time for common structured networks, over a wide class of 
long-range spreading dynamics. 

General Notation: We use Z and K for the set of integers 
and reals respectively, and also use standard Landau notation 
(O, 0, f2) for the asymptotic growth rate of functions. For 
random variables X and Y, the notation I< st y and Y > st 
X means that Y stochastically dominates X, i.e. P[Y > z] > 
V[X > z] for all z. Where necessary, we follow the convention 

that l/oo = 0. 

III. Main Results and Discussion 

We now state our main results, and discuss what they 
translate to for different models of local epidemics aided by 
spreading via external agents. 

We state our results for general external virulence L(t) 
(more specifically, in terms of the bounds L nlin (n) and 
L max (n) on ||i(t)||i) in two parts: upper bounds for spreading 
time for general graphs under specific policies (in particular, a 
random policy and a greedy policy), and lower bounds under 
any policy for certain specific graphs (in particular, rings/line 
graphs, <i-dimensional grids and the geometric random graph), 
which are spatially-constrained, and where the bounds are 
tight. We conclude the section with a discussion of the appli- 
cability of our bounds and techniques; in particular, we show 
how our bounds can be used to obtain results on spreading 
time for various models of external infection such as long- 
range links and mobile agents, and discuss their limitations. 

A. Upper Bounds for Specific Policies 

Our first main result is an upper-bound on the finish 
time (both in expectation and with high probability) of the 
homogeneous external virulence policy, or as we refer to it 
hereafter, the random spread policy, for a general graph G. 
Such a policy is equivalent to one in which the (single) external 
agent chooses a node uniformly at random and starts infecting 
it; hence the name 'random spreading policy'. The following 
result states that if G can be broken into a (large) number 
of uniformly-sized pieces, then the time taken by random 
spreading to finish is of the order of the number of pieces 
or the piece size, whichever dominates. 

Theorem 1 (Upper bound for Homogeneous External Vir- 
ulence: Diameter version). Suppose Li(t) — Lj(t), i,j = 
l,...,n, and ||L(t)||i > L m - U1 (n) > for all t > 0. 
Suppose also that for each n, the graph G n admits a partition 
G n = Ui£i G n .i by g(n) connected subgraphs G n ^, each 
with size 0(s(n)) and diameter 0{d(n)). Then, 
(a) (Mean finish time) E[T] = 0{h(n) logn), where h(n) = 

max (r~W'^ n ))- 
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(b) (Finish time concentration) If g(n) = Q(n s ) for some 
8 > 0, then for any 7 > there exists n = k(j) > 
such that P[T > nh(n) logn] = 0(n~ 7 ). 

To understand how this result is applied, consider a line 
graph on n nodes-this can be partitioned into y/n segments of 
length (diameter) y/n each, and hence by the above result, the 
random spreading policy takes 0(^/n\ogn) time to infect all 
nodes. We formally state and derive such results subsequently. 

Next we obtain a spreading time bound for a greedy spread- 
ing policy, which we call the Greedy Subgraph Infection (or 
GSI) policy. The policy is based on the (optimal) partitioning 
of the graph that we constructed in the above theorems, and 
is as follows: given the subgraphs Gi,i E {1, 2, . . . , g(n)}, 
they are infected through sequential greedy (as opposed to 
homogeneous) long-range contact, i.e., ||L(i)||i = L m i n (n), 
and L(t) is supported on a single node j(t) within any max- 
imally healthy subgraph at time t (i.e., one which minimizes 
\G l n The finish time of the GSI policy is 0(h(ri)) in 

expectation and w.h.p., which we state as follows: 

Theorem 2 (Upper bound for GSI Policy). Suppose for each 
n, the graph G n admits partition Ufl"^ G njl of connected 
subgraphs G n ^, each of size Q(s(n)) and diameter 0(d(n)). 
Further, d(n) — logn + o;(l). Then for spreading via the 
Greedy Subgraph Infection policy, we have E[T] = 0{h{n)), 
where h(n) = max (^ L 9 ^ n ^ , d(n)j. 

Again, applying this to the line graph with n nodes, we 
now get a spreading time of 0(-<Jn), which improves on the 
previous bound by a factor of log n. 

Our final upper bound is an alternate bound for the finish 
time with random external-agents in terms of a different struc- 
tural property intimately related to spreading ability in graphs 
- the conductance (also called the isoperimetric constant). The 
conductance 9(G) of a graph G — (V,E) is defined as 

E(S, V \ S) 

\s\ ' 



*(G) 



A 



inf 

S<ZV:KS< 



|V| 
— 2 



where for A, B C V, E(A. B) denotes the number of 
edges that have exactly one endpoint each in A and B. The 
conductance of a graph is a widely studied measure of how 
fast a random walk on the graph converges to stationarity [34|, 
1 29 1. Analogous to Theorem [T] the next result formalizes the 
idea that spreading on a graph is dominated by the larger of 
(a) the number of pieces it can be broken into, and (b) the 
reciprocal of the piece conductance. 

Theorem 3 (Upper Bound for Homogeneous External Vir- 
ulence: Conductance version). Suppose that for each n, the 
graph G n admits a partition G n — IJfl™'' G n ^ by g(n) 
connected subgraphs G n j, each with size 0(s(n)) and con- 
ductance Q(9(n)), Then, 

(a) (Mean finish time) E[T ff ] — 0(k(n) log g{n)), where 

(b) (Finish time concentration) There exists k > indepen- 
dent of n such that 



P[T, r > Kk(n) log g(n)} = 0( {log g(n)) 



B. Results: Lower Bounds for Specific Topologies 

Having estimated the spreading time of random and greedy 
external-infection policies, a natural question that arises at 
this point is: How do these policies compare with the best 
(possibly omniscient and adversarial) policy, i.e., with the 
lowest possible spreading time among all other infection 
strategies! To this end, we show that for certain commonly 
studied spatially-limited networks (i.e., with diameter S](n") 
for some a > 0), such as line/ring networks, d-dimensional 
grids and random geometric graphs, random spreading yields 
the best order-wise time up to a logarithmic factor (and the 
GSI policy yields the best order-wise time) to spread infection. 
In particular, for each of these classes of graphs, we establish 
lower bounds on the finish time of any spreading strategy, that 
match the upper bounds established in the previous section. 
Rings/Linear Graphs: Let G n = (V n ,E n ) be the ring 
graph with n contiguous vertices V n = {vi, . . . ,v n }, E n = 
{(vi,Vj) : j — i = 1 (mod n)}. By partitioning G n into v^Q 
successive -^/n-sized segments, where the diameter of each 
segment is y/n, an application of Theorem [T| gives: 

Corollary 1 (Time for random spread on ring graphs). For 

the random spread policy n r on the ring/line graph G n , we 
have 



(a) E[T ffr ] = O 



>(«) 



logn 



(b) For any 7 > 3a = a("f) > such that 
O(n-T). 



An) 



log n 



i.e., the finish time on an n-ring, with random mobility, is 
0{y/n\ogn) in expectation and with high probability. 

Our next main result is the following lower bound for the 
spreading time of an epidemic on ring graphs due to any 
external-infection policy. 

Theorem 4 (Lower bound for ring graphs). For the ring graph 
G n with n nodes, and given that ||L(t)||x < lVt > 0, there 
exists c > independent of n such that for any spreading 
policy 7T, 

P [Tjr < cy/n] = O (e-OMv^ 
Moreover, we have 

inf E[TJ = fl(Jn) . 
wen v ' 

(i-dimensional Grids: Building on the previous result, we next 
show that the random spread strategy achieves the order-wise 
optimal finish time even on d-dimensional grid networks where 
d > 2. Given d, the (i-dimensional grid graph G n = (V n ,E n ) 
on n nodes is given by V n — {1,2,..., n 1 / d } d , and E n = 
{(x,y)eV n xV n :\\x-y\\ 1 = l}. 

Consider a partition of G n into (n/L m i n ) 1 /' d+1 ^ identical 
and contiguous 'sub-grids' G n ,i, i = 1, . . . , n 1 /( d+1 ^ ) (for 
details, refer to Section \V-B\ . With such a partition, an 
application of Theorem [T] shows that 



'For ease of notation, we assume that fractional powers of n take integer 
value; if not, the bounds can be modified by appropriately taking ceiling/floor. 
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Corollary 2 (Time for random spread on d-dimensional grids). 
For the random spread policy ir r on an n-node d- dimensional 
grid G n , we have 

(a) E[T 7rr ] = 0(( Im ^ R J logn] 



(b) For any 7 > there exists a 



a (7) > with . 



T~ > 



logn = 0(n 7 ). 



i.e., the finish time with random external-infection on a d- 

( V \V(<*+i) \ 

dimensional n-node grid is Oil L - ^ J log n I in 

expectation and with high probability. 

Further we show that any external-infection policy on a grid 

// vl/(d+l)\ 

must take time ft I ( L J I to finish infecting all 

nodes with high probability, and consequently also in expec- 
tation, thereby showing the above bound is order-optimal. 



Theorem 5 (Lower bound: d-dimensional grids, bounded 
long-range virulence). Let G n be a symmetric n-node d- 
dimensional grid graph. Suppose that ||L(i)||i < L max (n) = 
uj(n) for all t > 0. Then, there exist C\,C2 > 0, not depending 
on n, such that 



T < ci 



= o 



-c 2 (-, 



Furthermore, if L max (n) — 0(n l e ) for some e £ (0, 1], then 

1 \ 

d+l \ 



E[T] = Q 



Geometric Random Graphs: Finally, we shift focus from 
structured graphs to a popular family of random graphs, widely 
used for modelling physical networks. The Geometric Random 
Graph (RGG) is a random graph model wherein n points (i.e. 
nodes) are placed i.i.d. uniformly in [0, 1] x [0, 1]. Two nodes 
x,y are connected by an edge iff ||x — y|| < r n , where r n 
is often called the coverage radius. The RGG G„ = G n (r n ) 
consists of the n nodes and edges as above. 

It is known that when the coverage radius r n is above a 
critical threshold of Wlogn/n, the RGG is connected with 
high probability (35] - In our l ast set of results, we show that 
similar to before, random spreading on RGGs in this critical 
connectivity regime is optimal upto logarithmic factors. First, 
we show with high probability that random spreading finishes 
in time O(^nlogn): 

Theorem 6 (Time for random spread on RGGs). For the 

planar random geometric graph G n (r n ), if r n > - 1 5 lQg " 
then there exists a > such that 

w r > a y/n/L min (n) log n] = 0. 



lim 

n— >oo 

Finally, we follow this up with a converse result that shows 
that no other policy can better this time (order-wise, up to the 
logarithmic factor) with significant probability. This directly 
parallels the earlier results about finish times on 2-dimensional 
grids, where random mobile spread exhibits the same optimal 
order of growth. 



Theorem 7 (Lower bound for RGGs). For the planar geo- 
metric random graph G n with r n — 0(y / logn/n) and any 
spreading policy ir with L max (n) = 0(n 1_c ) for some e G 



«(«) 



(0, 1], 3 j3 > such that lim„^o, 
1. 

C. Discussion and Extensions 

The framework of epidemic spreading with external agents 
encompasses many known models for epidemic spreading with 
long-range contacts (as we discussed previously in Section 
|n} : this is done by appropriately specifying L(t) e H.|^' 
as a function of time t, network topology and network-state 
S(t). For example, the presence of a single additional 'static 
long-range' link £ V 2 is equivalent to setting Lj(t) = 
plsM^Ljit) = /31ft(t)=i and L k (t) = OVfc $ {i,j} 
(where j3 is the rate of spreading on the edge). We now discuss 
the implications of our results and techniques on such models 
of long-range contact. 

Static Links: To demonstrate our results in the context of 
a graph overladen with additional static edges, consider a 
c?-dimensional grid with L(n) additional static links. Then 
we have the following lower-bound for the spreading time T 
(obtained by setting L max {n) = L(n) in Theorem 

Corollary 3. Let G n be a symmetric n-node d-dimensional 
grid graph, with L(n) additional static links. If L{n) = 



Oin 1 -') for some e e (0, 1], then E[T] = Q 



L(n) 



Note that by combining this with Theorem |2j we can also 
get the same lower bound on the diameter D(n) of the resul- 
tant graph. To see this, observe that by considering the entire 
graph as a single partition, Theorem [2] gives that the spreading 
time is O (D(n)), and thus D(n) = (^(n/L(n))^j by the 
above Corollary. One consequence of this is in the context 
of 'small-world graphs' (32), ||33l wherein the diameter of 
a d-dimensional grid on n nodes is reduced to 0(logn) by 
adding Q,(n\ogn) random long-range edges. The usefulness 
of the above result is to show that it is not possible to obtain 
such sub-polynomial diameters by adding 0(rt 1_e ) edges. 

We note also that this bound is tight. We can see this from 
the following simple example: partition the graph into L(n) 
identical segments, and add an edge between a chosen vertex 
i and a single vertex in each segment. Now for an epidemic 
starting at node i, it is easy to see that the resultant process 
is equivalent to the 2-phase spreading process in the proof 
of Theorem [2] (i.e., parallel seeding of clusters followed by 
local spreading in clusters). Hence, the spreading time for this 
process is O ^(n/L(n)) 3 + T ^ . 

Dynamic Links and Mobile Agents: A more surprising result 
is obtained by considering spreading on a grid with additional 
dynamic links, i.e., long-range links which can change their 
endpoints as time progresses. Unlike a static link which can 
transmit the infection only once (before both its endpoints are 
infected), such dynamic links can be re-used over time to help 
spread the infection. However, we can again use Theorem [5] 
to get the following 
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Corollary 4. Let G n be a symmetric n-node d- dimensional 
grid graph, with L = 0(n 1_e ), e G (0, 1] additional dynamic 



links. Then E[T] = ft 



Thus, using dynamic links does not reduce the order of the 
spreading time. 

Related to dynamic links are models of epidemic spread- 
ing via mobile agents-in such a context, assuming L(n) 
mobile agents, each with constant infection-rate, Theorem 
[5] again gives the same converse for spreading time, i.e., 
n ^(n/L(n)) 3 + T ^ for <i-dimensional grids. Furthermore, the 
techniques of Theorems [T] and [2] can be used to give up- 
per bounds for various models of mobility: for example, 
for L mobiles moving randomly on a d-dimensional grid 
(where each mobile is unconstrained by the graph as to its 
next location), Theorem [T] shows that the spreading time is 
O ((n/L(n))^y 

IV. Spreading-Time Bounds for Specific Policies 
In this section, we formally prove the upper bounds on 



spreading time we stated in Section III-A We first prove the 
diameter-based upper-bound on the finish time of the random 
spread policy for a general graph G. 

Theorem (Theorem [T] Upper bound for Homogeneous Ex- 
ternal Virulence: Diameter version). Suppose Li(t) — Lj(t), 
i,j = 1, . . . ,n, and \\L(t)\\i > L m i n (n) > for all t > 0. 
Suppose also that for each n, the graph G n admits a partition 
G n — Ufi.™ G Ut i by g(n) connected subgraphs G n ^, each 
with size 0(s(rt)) and diameter 0(d(n)). Then, 

(a) (Mean finish time) E[T] = 0(h(n) logn), where h(n) = 

max (mb' d ( n ))- 

(b) (Finish time concentration) If g(n) — fl(n s ) for some 
8 > 0, then for any 7 > there exists k = ^(7) > 
such that P[T > nh(n) logn] = 0(n" 7 ). 

In other words, given any partition of a large graph, the 
spreading time of an epidemic process, assisted by random 
external infection, is determined by both (a) the time taken 
for spread to start in each segment of the partition and (b) the 
worst possible time taken by the intrinsic spread within each 
segment. 

Proof: The proof uses techniques from (a) stochastic 
majorization and (b) graph-partitioning into suitable shortest- 
path spanning trees. 

Under the homogeneous (or random) external infection 
policy, we have that Li(t) > L m ; n (n)/n for all i = 1, . . . , n. 
As before, (S(t))t>o denotes the infection state process. Now 
observe that each subgraph G n .i = Gi is prone to infection 
(i.e., some node in G n j contracts infection) due to long-range 
contact with an exponential rate of fl (L m i n (n) ■ s(n)/n). 
Consider an alternative infection-spreading process (S(t)) t > 
which evolves in two phases: 

• Phase-1: Throughout this phase, infection spread oc- 
curs only due to long-range contact, and not through 
basic/static contact. The phase starts from time t = 
and ends when at least one node in each subgraph Gi is 
infected. Let T\ be the end time of this phase. 



• Phase-2: In this phase, infection spread occurs only due 
to basic contact through neighbours in G, and not through 
long -rang contact. At time T\, for each subgraph G,;, only 
the first node infected in phase-1, say iV,, is assumed to 
be infected, and all other nodes in Gi are considered 
healthy, even if some of them were infected on phase- 
1 subsequent to iV,. The process S(-) proceeds from 
time Ti onwards by the usual static spread dynamics 
within each Gi, i.e. with the restriction that infection does 
not spread across edges that connect different subgraphs. 
Denote by T2 the additional time taken (since Ti) for all 
nodes in all the G, to get infected. 

A standard coupling argument establishes that Af(S(t)) 
stochastically dominates JV(S(t)) for all t, i.e., S is a 'slower' 
process than S. Thus, the finish time for S(-) stochastically 
dominates that of S(-), i.e. 



(1) 



It remains to estimate the means of T\ and T2, and their 
tail probabilities, to finish the proof. The analysis for T\ fol- 
lows a standard coupon-collecting argument: memorylessness 
of the exponential distribution implies that T\ is stochasti- 
cally dominated by the maximum of g(n) i.i.d. exponential 
random variables with parameter Vt(L m i n (n) ■ s(n)/n), i.e., 
^(^min("-) /g{ n ))- Hence, using a well-known result about the 
expectation of the maximum of i.i.d. exponentials, we obtain 



E[Ti] = O 



H, 



L min (n)/g(n), 
0(g(n) log g{n)/L min (n)), 



(2) 



A 



where = X)i=i* _1 = G(logfc) is the /cth harmonic 
number. Also, by a union bound over the tails of g(n) i.i.d. 
exponential random variables, for any k > we can estimate 
the tail of T x : 

V[Ti>ng(n) log g(n)\ 

< 5(n)e~^ e ^ imin ^ I ^ / '9( n )) K 9(n) log g(n)) 

= 5 ( n )-e(«Wn))+i. (3) 

To estimate the statistics of T 2 , we further consider the 
following 'slower' mode of (static) spread than that of phase- 
2: for each subgraph Gi (with diameter 0(d(n))), let W{ be a 
shortest-path spanning tree of Gi rooted at the node Ni which 
is infected in phase-1. Such a tree has diameter 0(d(n))) and 
can, in principle, be obtained by performing a Breadth-First 
Search (BFS) on Gi starting at Ni. If we now insist that the 
phase-2 static infection process in Gi spreads only via the 
edges of Wi, then again, a standard coupling can be used to 
show that the time T2 when all nodes in G get infected thus 
stochastically dominates T 2 . 

Before proceeding, we need the following simple lemma, 
which we state without proof: 

Lemma 1. For real numbers aij, 1 < i < m, 1 < j < n, 

max™ x YT S =i aij < £"=1 max,™ 1 a ir 

Now for each tree W%, let its leaves be labelled 
Nn, ■ ■ ■ ,Nn(iy Each leaf has a unique path pij starting 
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from Ni to itself, of length 0(d(n)). Let Tjk be the time taken 
for the infection to spread across the fcth edge on this path pij, 
i.e. the (exponentially distributed) interval between the times 
when the (k — l)th node and the kth node on the path are 
infected. Then, the time Tjj.t taken for all nodes in Wi (hence 
Gi) to get infected can be upper-bounded by using Lemma [T] 

0(d(n)) 



, s \Pii\ 

J-jk 

t 



E 

fe=i 



max ljk 



and a further application of the lemma bounds the phase-2 
finish time T 2 = maxfi." as 



. o(d(n)) , 
^ . 9(n) \ - / ^ 
J? < max > max Jot 

fc=l v 

0(d(n)) 

< > maxmaxJj). 
fe=i ^ 1-1 J ' =1 

The term in brackets is simply the maximum of the infection 
spread times across all stage-fc edges of all the trees Wi within 
G. Hence, it is stochastically bounded above by the maximum 
of n i.i.d Exponential 1) random variables (say Z\, . . . , Z n ), 
using which we can write 



E[T 2 ] < E[f 2 ] < 



0(d(n)) 

E 

fc=l 



0(H n ) = 0(d(n)logn). (4) 



Again, using the union bound to estimate the tail probability 
of T 2 , we have, for any k > 0, 

P[T 2 > Kd(n) log n] < P[f 2 > nd{n) log n] 

< 0{d(n))¥[Z 1 > Klogn] 
<n-ne- Klosn =n- K+2 . (5) 

We now have all the required pieces. Combining ([T]), (|2]) 
and Q with the fact that g(n) = 0(n) proves the first part of 
the theorem. For the second part, the hypothesis that g(n) = 
Q(n s ), together with (gj, gives 

P[Ti > Kh(n)logn) < P[r x > Kg[n)logg(n)] 

which, together with Q and |5]), gives 

P[T^ r > 2nh{n) logn] < ¥[T X +T 2 > 2nh(n) logn] 
< n -5e( K L min (™))+<5 + n -«+2 

Choosing k s.t. min{<5 (®(K,L m i n (n)) — 1), n — 2} > 7 yields 
the bound in the second part of the theorem. ■ 
The factor of logn in the bound of the above theorem is 
actually only due to the 'coupon-collector' effect phase- 1 time 
T\\ a more refined analysis of the phase-2 time T 2 shows that 
if d(n) = logn + w(l), i.e. the piece diameter is sufficiently 
large, then T 2 is order-wise d(n) in expectation and w.h.p. 
This is the intuition behind the spreading time bound for the 
Greedy Subgraph Infection policy. Recall the GSI policy was 
defined as follows: given the subgraphs Gi, they are infected 
through sequential greedy (as opposed to homogeneous) long- 



range contact, i.e., ||i(<)||i = L ni i n (n), and L(t) is supported 
on a single node j(t) within any maximally healthy subgraph 
at time t (i.e., one which minimizes | DiS(£)|). Then we 
have: 

Theorem (Theorem |2j Upper bound for GSI Policy). Suppose 
for each n, the graph G n admits partition Uf=i @n,i °f 
connected subgraphs G 7lt i, each of size Q(s(n)) and diameter 
0(d(n)). Further, d(n) — logn + uj(1). Then for spreading 
via the Greedy Subgraph Infection policy, we have E[T] = 
0(h(n)), where h(n) = max ( j, 9 f"( n ) > d(n)J. 

Proof: Using the same notation as the earlier proof, we 
consider the slower, two-phase spreading process, such that 
T < s t Ti + T 2 : in this case however, phase-1 consists of a 
sequential 'seeding' of each subgraph (it is clear that this is 
stochastically dominated by the greedy subgraph infection). 
Thus T\ now corresponds to the sum of gin) i.i.d exponential 
random variables with parameter £l(L m i n (n)) (i.e., there is no 
coupon-collector effect), and thus, via standard results, con- 
centrates around around its mean which is 0(g{ri) / L m [ n (n)). 
To complete the proof, we need to tighten our previous bound 
for T 2 (and hence, T 2 ), which, using our previous notation, 
can be written as: 



rf, ff(") rf, ff(") ^ ' 

1 2 = max 1 2 i = max max > Jot, 

2 = 1 ' 2 = 1 1 = 1 ^— ' J 

k=l 

i.e., T 2 is the maximum sum of infection times over all leaves 
in all trees Wi. Since the total number of leaves in all the trees 
Wi is at most n, a union bound yields, for any a > 0, 



P[f 2 > ad(n)} < nP 



d(n) 

Zi > ad(n) 



where all the Zj are independent Exponential 1) random 
variables. A Chernoff bounding technique yields 



d(n) 

Zi > ad(n) 



where < ip < 1. With ip = 1/2 and any a > 0, we have, 

P[f 2 > ad(n)] < n ■ 2 di -^ e ~^. 
Finally, for estimating E[T 2 ] we have, 



EfTo 



P[f 2 > x]dx 



< (21og2 + 2)d(n) 

+ d[n) / P[T 2 > ad(n)]da. 

J 2 log 2+2 

< 3d(n) + Tnd(n) I e-^da 

J 2 log 2+2 

= U{n) + 2ne~ d(n) = 0(d{n)) 
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Thus we have E[T] = O ( max ( L 9 ^\ n) , d(n) 



We conclude this section with the conductance-based upper 
bound on the finish time with random external-agents. 

Theorem (Theorem [3] Upper Bound for Homogeneous Exter- 
nal Virulence: Conductance version). Suppose that for each 
n, the graph G n admits a partition G n — Uii? Gn.i by 
g(n) connected subgraphs G n ^, each with size 0(s(n)) and 
conductance 0(^(n)). Then, 



(a) (Mean finish time) K[T n 



k(n) 



0(fc(n) log g(ri)), where 



n) logs(n) 



,(n)' *(n) 

(b) (Finish time concentration) There exists k > indepen- 
dent of n such that 

P[T» r > Kk(n)logg(n)] = O ((log<?(n))~ 

Proof: As in Theorem [T] we study an associated two- 
phase spreading process (S(t))t>o, where the first phase takes 
time T\ to infect at least one node in each Gi, and the infection 
takes a further time T 2 to spread within every (connected) Gi. 
A coupling argument establishes that T„ r < st T\ + T 2 . 

As before, T\ is distributed as the maximum of g(n) 
Exponential(6(L m ; n (n)/(7(n))) random variables, and stan- 
dard results yield (for n > 0): 



E[7\] = 0{g(n)logg(n)/L min (n)), 



(6) 



and for the variance: 



Var[Ti] = 



6(U(») a /ff(n) J ) S 1 



s(«) ^ 
I2 



9 (g(n) 2 /L min (nf 



(7) 



Next we have that T 2 is the maximum of the times T 2 ,i for 
infection to spread in each subgraph Gi. We stochastically 
dominate each T 2 ^ as follows: for each subgraph Gi, consider 
a continuous time Markov chain (Zi) t >o on the state space 
1, . . . , |V(Gj)| with Zi(0) = 1 and transitions j j+l at rate 
if 1 < j < \V(Gi)\/2, and at rate (|V(Gi)| - j)*(n) 
if \V(Gi)\/2 < j < \V(Gi)\ - 1. Let fa,, be the time taken 
for the Markov chain Z l to hit its final state |F(Gj)|; f 2 j = 



E 



|V(G 4 )|-1 



T%,%,j where T 2 ,i,j is the sojourn time of Z» in 



state j. We claim that stochastically dominates T 2 ^. To 
see this, note that at any time t, if the number of infected nodes 
in the phase-2 spreading process in Gi is 1 < j < \V(Gi)\/2, 
then by the definition of conductance, the rate at which a new 
healthy node in Gi is infected is at least j^(n). Similarly, if 
the number of infected nodes is \V(Gi)\/2 < j < |V(G»)| (i.e. 
the number of healthy nodes is (| V(Gi)| — j)), then the rate at 
which a new healthy node is infected is at least (|V"(Gj)|/2 — 
j)^(n). Hence, by standard Markov chain coupling arguments 
(e.g., see HI]), we have that T 2 ^ < st f 2 . t . 

By independence of the original phase-2 spreading pro- 



cesses within the Gi for all i = 1, . . . ,g(n), we have 

\V(G t )\-l _ 

T 2 = max T 2 ,% < s t max T 2<i = max } f 2ti j 



|V(G«)|-1 

< maxT2,i.j 
3=1 



Hence we have 

I^CGi)!-! 



max Tc 



2,i,j 



E[T 2 ] < E 

' log s(n) log fli(n) 



|V(G0l/2. , . 

2 loggW 



*(n) 



(8) 



Also, 




inc?0l-i 

Var fmaxTa^j 



= 6 



(9) 



Combining |6]l and ([8]l gives the first part of the theorem. For 
the second part, we have 



P[T„., > «fc(n) log ^(n)] < Ppi + T 2 > «A;(n) log 5(71)] 
< 



m „ re / / s log s(n)\ , , , 
r 1+ T a >-^(n) + -|^Jbgi,(n) 

Now, using the variance estimates |7]l and |9]) with Cheby- 
shev's inequality, we have for large enough k > 

P[T„ r > nk(n) log. 9 (n)] 

Var^+E^f^'-'max,^,^) 



< 



log 2 5 (n) ( 5 (n) + 1 ^ 

/ g(n) 2 , 1 

£min(n) 2 *(n) 2 

v log 2 .g(n) + 



= 0^(log 5 (n))- 
This completes the proof. 



V. Lower Bounds for Specific Graphs 

In the previous section, we have estimated the time taken 
by random (and greedy) external-infection policies to infect all 
nodes in a network. In this section, we derive corresponding 
lower bounds for certain commonly studied spatially limited 
networks, in particular, line/ring networks, d-dimensional grids 



and random geometric graphs. As discussed in Section III-B 



for each of these classes of graphs, we establish lower bounds 
on the finish time of any spreading strategy (possibly om- 
niscient and adversarial) that match the upper bounds (upto 
logarithmic factors for random spread, and exactly for the GSI 
policy). 
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A. Ring/Linear Graphs 

As before, for each n we define G n — (V n ,E n ) to be the 
ring graph with n contiguous vertices V n — {vi,...,v n }, 
E n — {(vi,Vj) : j — i = 1 (mod n)} . In the con- 
text of Theorem [l] let us partition G n into y/n successive 
•y/n-sized segments, i.e. G n; i is the subgraph induced by 

Wti-ijVn+D • • • ) v i^m where * ran g es from !> • • • > V™- The 
diameter of each segment is %fn, and a straightforward appli- 
cation of the theorem gives 

Corollary (Corollary [2] Time for random spread on ring 
graphs). For the random spread policy ix r on the ring/line 
graph G n , we have 

(a) E[T„ r 



°(V™ logn > 

(b) For any 7 > 3a = a (7) > such that 
^logn] =0(n"T). 



i.e., the finish time on an n-ring, with random mobility, is 
0(\/n log n) m expectation and with high probability. 

We now prove that the finish time on a grid or line graph 
with any (possibly infection-state aware) external-infection 
spread strategy must be J7(y / n), both in expectation and with 
high probability. This establishes that for ring graphs (or 1- 
dimensional grids), random external-infection is as good as 
any other form of controlled infection in an order-wise (up 
to a logarithm) sense. Furthermore, we use this theorem to 
introduce a general technique for obtaining lower bounds 
based on stochastic dominance via a parallel cluster-growing 
process. For ease of notation, we assume ||Z(f)||i < 1 in this 
proof-in the next section, we obtain the more general bound 
(with dependence on L max (n)) for c?-dimensional grids. 

Theorem (Theorem [4j Lower bound for ring graphs). For 

the ring graph G n with n nodes, and given that \\L(t)\\i < 
1 Vt > 0, there exists c > independent of n such that for 
any spreading policy 7T, 

P [T n < c«/n] =0(e 

Moreover, we have 



-e(l)Vn 



inf E[TJ =n(y/n). 

Proof: To keep the proof general, we use a parameter 
(3 for the intrinsic spreading rate over an edge (assumed to 
be 1 earlier). Along with the spreading process (S ,r (i)) t >n 
induced by the policy tt, consider a random process (S(t)) t >o 
described as follows: 

(a) At all times t, S(t) consists of an integer number (C t ) of 
sets of points called clusters, where (Ct)t>o is a Poisson 
process with intensity L max {n) = 1, and Cq — 1 (the 1 
denotes an 'initial' cluster in which static infection starts 
spreading). 

(b) Once a new cluster is formed at some time s, it grows, 
i.e. adds points, following a Poisson process of intensity 
2/3 (recall j3 is the intrinsic spreading rate for an edge 
in the graph). 

Via a coupling argument, it can be shown that for all 



spreading strategies tt 6 IT, at all times t > 0, the total number 
of points in S(t) (denoted by N t ) stochastically dominates that 
in S v (t) (informally, this is due to two reasons: first, that the 
rate of 'seeding' of new clusters by tt is at most as fast as that 
in £>(■); secondly, each cluster in S(-) grows independently and 
without interference from other existing clusters, as opposed 
to clusters that could 'merge' in the process 5' 7r (-)). Figure 
[T] graphically depicts the structure of the dominating process 
S(-). 



Mobile agent 
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t = t, 



Origin 2/? 



IP 
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Fig. 1 . Dominating the infection spread using independently growing clusters 

Let f = inf{i > : N t = n} be the time when the 
number of points in §(■) first hits n. Owing to the stochastic 
dominance J\f(S n (t)) < s t Nu we nave tnat 



t < st t„ Vtt e n. 

Knowing the way S(-) evolves, we can calculate E[N t ] 
E[N t ]=E[E[N t \C t }} 

OO 

= J2nCt = k)E[N t \C t = k] 



(10) 



k=0 

^ e-H k 



k=0 



E[N t \C t = k}. 



Since Ct is a Poisson process, conditioned on {Ct = k}, the 
k cluster-creation instants are distributed uniformly on [0,t]. 
Let the times of these arrivals be Ti, . . . , 2V, then [Tj, t] is the 
time for which the ith cluster has been growing. Since every 
cluster grows at a rate of 2f3, conditioned on {Ct = k}, the 
expected size of the ith cluster is 2/?(t — Tj), 1 < i < k. Also, 
the expected size of the '0-th' cluster at time t is 2j3t. Using 
E[fi\C t =k] = t/2, we obtain 



k 



E[N t \C t — k] — 2/3< + > E[2p(t - fy\C f = k] 



i=l 



P(k + 2)t 



E[N t } = J2^iPnNt\Ct = k} 



uk 



k=0 



- e~H k 



E 



A-! 

Pi 2 + 2/3t 



I3{k + 2)t 



k=0 
+2 



to 



Hence, we have 



A 



i 1 '*}*, and E n = {(x,y) € 



P(f >t) = P(N t < n) = 1 - P(N t > n) 
E[iV t l 1 /3(i + l) 2 



> 1 



> 1 



n n 
by an application of Markov's inequality. This means that 



E[T] = / P(T > > 



P(T > 



> 



o 

2 /n 



3V/3 
Q(Vn) 



- 1 



3n 2 



Together with (10 1, this forces inf w6 n E[T ff ] = 17 (-^/n), and 
the first part of the theorem is proved. For the second part, if 
we denote the size of the ith created cluster at time s > Ti 
by Xi(s), then for any time t we can write 

(f){Mt + Ti) < 4e/3i} ) f]{C t < 2et} 



\i=0 



'C{t) 



C \ f]{X t (t + T t ) < AefJt} f]{C t < 2et} 



i=0 



C ( f]{X t (t) < 4e(3t}\ f]{C t < 2et} 

C {N t < 8/3e 2 i 2 }. 

Applying a standard Chernoff bound (P[Y > 2eA] < (2e)~ A 
for y ~ Poisson(A)) to C t ~ Poisson(Q and + Ti) ~ 
Poisson(2/3i) above, we can write 



2et 



F[N t > 8(3e 2 t 2 } 

< P[C t > 2et] + F [ x i{t + T i) ^ 4e ^] 

< (2e)- f + 2ei • (2e)- 2/3 * 
= 0(e-*( 1A2 «). 



In conclusion, using the stochastic dominance (lOi: 

P i; < , ? : ■ 



T < 
N r 



> n 



8/3e 







B. d-Dimensional Grid Graphs 

Extending the previous result, this section shows that the 
simple, state-oblivious random external-infection spreading 
strategy achieves the optimal order-wise finish time on d- 
dimensional grid networks for d > 2. For such a dimension 
d, the (i-dimensional grid graph G n — (V n ,E n ) on n nodes 



is given by V„ = {1, 2, . . 
V n X V n : \\x- 2/j | x = 1}. 

Consider a partition of G„ into (n/L m i n ) 1 ^ d+1 ' ) identical 
and contiguous 'sub-grids' G n ,u i = 1, . . . ,n 1 ^ d+1 \ By 
this, we mean that each G n i is induced by a copy of 
{1,2,..., (n/L min ) 1 /( d+1 '>} d (and thus has {n/ L m{n ) d /^+i) 
nodes). For instance, in the case of a planar %fn x %fn grid 
(with L m i n = 1), imagine tiling it horizontally and vertically 
with {/n identical ^fn x {/n sub-grids (Figure gj. With such 
a partition, an application of Theorem [T] shows that 

Corollary (Corollary |2j Time for random spread on d-grids). 
For the random spread policy ir r on an n-node d-dimensional 
grid G n , we have 

(a) E[T nr ] = 0(( J ^ nr) 



(b) For any 7 > there exists a 



log nj 
a(j) > with 



log n 



i.e., the finish time with random external-infection on a d- 
dimensional n-node grid is Oil ^ ™ ( n ) ) 1°§ n > m 

expectation and with high probability. 
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Fig. 2. Partitioning a planar grid into sub-grids 



In what follows, we show that any external-infection spread 

( f x i/(rf+r~ 

ing policy on a grid must take time fl 



to finish infecting all nodes with high probability, and con 
sequently also in expectation. Barring a logarithmic factor, 
this shows that such a a random policy is as good as any 
other (possibly state-aware) policy for the class of grids. In 
order to derive this lower bound, we first need the following 
lemma from the theory of first-passage percolation ifTTl . which 
essentially lets us control the extent to which infection on an 
infinite grid has spread at time t: 

Lemma 2. Let (Z(t)) t >o G {0, 1} Z represent a static/basic 
infection spread process on the infinite d-dimensional lattice 
Z d starting at node (0, 0, ... , 0) at time 0. Then, there exist 
positive constants l,c 3 ,C4 such that for t > 1, 

V[Af(Z(t)) > t d l d ] < cit 2d e- C2V? . 

Proof of Lemma |2j Let 

B(t) = {ve Z d : Z v {t) = 1} C 1 d (C R d ) 

be the set of infected nodes at time t in Z. We use the 
following version of a result, from percolation on lattices 
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with exponentially distributed edge passage times, about the 
'typical shape' of B(t) ifTTll : 

(Theorem 2 in 4771/ ) There exists a fixed (I.e. not depending 
on t) cube Bo — [— 5, 4] C M d , anc/ constants c\,c% > 0, 
5Mc/z that for t > 1, 



P B(<) C iB > 1 - cit 2d e- C2v/ *. 
It follows from {FT} that for t > 1, 

P[AA(Z(t)) > t<¥] = F[\B(t)\ > t d l d ] 



(11) 



2d-c 2 Vi 



< ¥[B(t) tB ] < at 



Lemma [3] allows us to control the growth of individual 
seeded clusters of infected nodes; this is analogous to the use 
of the dominating spread process (growing at rate 2/3) for line 
graphs. We thus obtain the following lower bound. 

Theorem (Theorem [5] Lower bound: d-dimensional grids, 
bounded long-range virulence). Let G n be a symmetric n- 
node d-dimensional grid graph. Suppose that < 
I max (ri) = u}(n) for all t > 0. Then, there exist c\,C2 > 0, 
not depending on n, such that 



T < ci 



o 



Furthermore, if L mffl (n) = 0(n l e ) for some t £ (0, 1], then 
\\L max (n) 

Proof: Let us introduce a (dominating) counting process 
(S(t))t>o, described as follows: 

• yt > 0, S(t) consists of an integer number (C t ) of sets of 
points called clusters, where (Ct)t>o is a Poisson process 
with intensity L meix (n), and Co = 1 (denoting an 'initial' 
infected node). 

• Each cluster grows as an independent copy of a static 
infection process on an exclusive infinite <i-dimensional 
grid Z d starting at (0,0,..., 0). 
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Fig. 3. The grid graph: Coupling infection spreading with mobility to a 
dominating 'cluster-growth' process 

Note that in the process S, the growth of each cluster follows 
the natural static infection dynamics within a <i-dimensional 
grid graph. Again, a standard coupling argument shows that 
at all times t > 0, the total number of points in S(t) (denoted 



by N t ) stochastically dominates that in 5(t)-this is essentially 
due to (a) cluster 'seeding' at the highest possible net expo- 
nential rate L max (n), and (b) the absence of 'colliding' or 
multiple infections incident at any single node (Figure [3]). Let 
T = mi{t > : N t — n} be the time when the number of 
points in £>(■) first hits n. Then we have 



M(S(t)) < st N t => f < st T. 



(12) 



Let us denote by Xi(s) the size of the ith created cluster of 
S(-) at time s > T^. Then, for t > 0, we have 

^f]{Xt(t + T t ) < t d l d }^j f| ({C t < 2eL max (n)<}^ 

C {N t < 2eL max (n)l d t d+1 }, 

Now each random variable Xi(t + T,) is distributed as the 
number of infected nodes in a static infection process on an 
infinite grid at time t. Thus, using Lemma [2] and a standard 
Chernoff bound for Ct ~ Poisson(tL max (n)), we can write 



N t > (2eL max (n)l d )t 



< P C t > 2eL max (n)t 



2eL max (n)t 



E 



d 1 d 



Xi(t + Ti)>t a l 



< (2e)- L — + 2eL max (n)t ■ c z t 2d e~^ 
= 0(L max (n) e - C4V *). 
With the stochastic dominance ( fl"2| >, this forces 



T < 



< 



2eL max (n)l d 
n 



T < 



N, 



2eL max (n)l d 



i/(d+i) 



i/(d+i) > n 



O 



y 2el. max (n)! a 



- Ci ( t m I M )' 



(13) 



for the appropriate c%, establishing the first part of the theorem. 
To see how this implies the second part, note that the estimate 
(13 1, together with the fact that L max (n) = 0(n 1_e ) and the 
Borel-Cantelli lemma, gives us 

i/(d+i) 



T < 



lim inf 



2eL rnax (n)l d 
f 



for finitely many n 
1 



™ (n/L max (n)) 1/{d - 
By Fatou's lemma, 

f 

lim inf E 



o.s. A 



( 2e ^)i/(d+i) 



1, 



> 



(n/L max (n)) 1/{d+1) 



> E 



T 

lim inf ... . 



(n/L max (n)) 



> c 4 > 0. 
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This shows that E[T] > E[T] = n ((n/L max (n)) 1/(d+i: 
and concludes the proof of the theorem. 



C. Geometric Random Graphs 

We finally prove the upper and lower bounds for the 
Geometric Random Graph (RGG). Recall that an RGG is a 
family of random graphs wherein n points (i.e. nodes) are 
picked i.i.d. uniformly in [0,1] x [0,1]. Two nodes x, y are 
connected by an edge iff ||x — y\\ < r n , where r n is often 
called the coverage radius. The RGG G n = G n (r n ) consists 
of the n nodes and edges as above. 

It is known that when the coverage radius r n is above a 
critical threshold of yJlogn/iT, the RGG is connected with 
high probability l35l . In this section, we state and prove two 
results that show that random spreading on RGGs in this 
critical connectivity regime is optimal upto logarithmic factors. 
First, we show with high probability that random spreading 
finishes in time O(tfnlogn), and follow it up with a converse 
result that says that no other policy can better this order (up 
to the logarithmic factor) with significant probability. This 
directly parallels the earlier results about finish times on 2- 
dimensional grids, where random mobile spread exhibits the 
same optimal order of growth. 

Theorem (Theorem [6] Time for random spread on RGGs). 
For the planar random geometric graph G n (r n ), if r n > 

51 ° s " , then there exists a > such that 

lim F[T nr > a?/n/L lrLin (n) logn] = 0. 

n— >oo 

Proof: Divide the unit square [0, 1] x [0, 1] into square tiles 
of side length r n /\/5 each; there are thus 5/r 2 such tiles, say 



fci , . . . , fc 5 i r i . If n points are thrown uniformly randomly into 
[0, 1] x [0, 1], then, with £ denoting the event that some tile 
is empty, 



P [£] < -5-P [tile 1 empty] 



1 - 



< 



exp 



nr r. 

5 



< 



logn 
1 

logn 



cxp(— log n) 



0. 



(14) 



By construction, note that the maximum distance between 
points in two horizontally or vertically adjacent tiles is exactly 
r n . Hence, two nodes in horizontally or vertically adjacent tiles 
are always connected by an edge. Also, a node in a tile is not 
connected to any node in a tile at least three tiles away in 
either dimension. 

If we now divide [0, 1] x [0, 1] into (bigger) square chunks 
of side length 1 



nL m i n (n) 2 each, there are 



V5 x VE 



r — 2 Vt such 
grid of square 



square chunks, each containing a r - 
tiles. In the case where no tile is empty, it follows from the 
arguments in the preceding paragraph that the diameter of the 



subgraph induced within each chunk is 



o 



o 



= o 



r n %JnL min (n) 2 
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since r n > logn. An application of Theorem [T] in this case 
shows that E [T OTr \£] — 0(yjn/ L m \ n (n) logn), and for some 

a, 7 > 0, P \T nr > a$/n/L min (n) logn | s] = O(n^). 



Using (14 1, we conclude that 

r nr > ai/n/L nlin {n) logn] = O ( ) n -±+ 0. 

\logn/ 

■ 

Consider an infinite planar grid with additional one-hop 
diagonal edges, i.e. G — (V, E) where V = Z 2 , E = {(x, y) e 
Z 2 : ||x — j/Hoo < 1}. Let an infection process (S(t)) t > 
start from G Z 2 at time according to the standard static 
spread dynamics, i.e. with each edge propagating infection at 
an exponential rate /i, and let I(t) denote the set of infected 
nodes at time t. The following key lemma helps control the 
size of I(t), i.e. the extent of infection at time t: 

Lemma 3. There exists c\ > such that for any c-i > and 
t large enough, 

P [3a; € I{t) : > (ci/x + c 2 )t] = 



O ((ci/i + c 2 )t ■ e~ C2t ) 



Proof: 



V[3x £ I(t) : \\x\loo > ct] 

IMU = [ct\,T{v) < t] 

P[3 a path r : -> v,T(r) < t] 



< ¥\3v E Z 2 
< 



E 



--[ct} 



Observe that for any v with H^Hco = [ct\ and any path 
of edges r from to v, there must exist \ct\ + 1 nodes 
vq — 0, v i, . . . , vy ct \ on the path r such that | \vi\ |oo < \ct\ and 
— ViWoo = 1. Indeed, each edge on a path can increase 
the 1 1 • | |oo distance from by at most 1. Therefore, continuing 
the above chain of inequalities, the sought probability is 
bounded above by 



^2 E P P a P ath r : 



v passing 



{u:|H| oo = |ctJ}«0,---,«|ot| 



successively through the Vi,T(r) < t] 

3 a path r passing 



< E E 

•MMU=Lctj}i>o,...,i>Lc tJ 



Lctj-i 



^ E E 

{«:|M|=o=lctJ}«o,"-,«|ctj 



successively through the Vi, T(vi,v i+ i) < t 

-0 

=tj-i 

^2 T(vi,v. l+1 ) < t 



i=0 



(15) 



where the second sum runs throughout over all Vi with vq = 0, 
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= 1, and T(x, y) represents 
the infection passage time from node x to node y. Letting 
T'(vi,v i+ i) be random variables identically distributed as 
T(vi, f i+i) but independent for i — 1, . . . , \ ct\ — 1, we can 
write, for -0 > 0, 



E i 

JJ ,...,1/LetJ 



LrfJ-1 

^2 T(vi,v i+ i) < t 



E 

vo,— ,*>[c*J 



LctJ-1 



i=0 

Lctj-i 



< e n e 

LctJ-1 



e n e 



-ipT (vi ,Vi+i ) 




Lctj 



-i/iT'(0,u) 



{«:|MU = 1} 



In the last step of the above display, we have successively 
summed over ui ct j , v \ct\ -i, ■ ■ • , vo, and have used the fact 
that infection spread times are translation-invariant, i.e. for 

any x,y,a £ Z 2 , 

T '(%, V) = T i x , y) = T{% + a,y + a) 
= T'{x + a,y + a). 



For any u G Z 2 such that it is a neighbour of (i.e. ||u||oo = 
1), we must have T(Q,u) > mm w: n w n—-it((0,w)), where 
t{e) ~ Exp(/x) is the travel time of the infection across edge 
e E E. Since the number of neighbours of in G is exactly 
8 (4 up-down/left-right and 4 diagonal), T(0, u) stochastically 
dominates an exponential random variable with parameter 8fi. 
Thus, 



E 




< E 

















(where f - Exp(8^)) 




(16) 



Setting ijj = 8jLt(8e - 1) so that 8(1 + i/V/i) -1 = e~\ ( |16| 
becomes 



E E 

.{«:||u||oo=l} 



-i/>T'(0,u) 



Finally, letting ci 



< e 8/i(8e-l)t . e -c*+l_ 

Be — 1) and c = ci/i + C2, we obtain 



the desired result from ([15) and the above: 



E E 

{v:|MI=o = LctJ}«o,-,«Lctj 



Lctj-i 

^ r(ui,« i+1 ) < i 



<|{,;:||H| 00 = M}|-e- C2t+1 

< (4ct) • e- C2t+1 

= 0((c lA/ + c 2 )t-e- C2t ). 

■ 

Using Lemma [3] we can finally state our converse result for 
the geometric random graph: 

Theorem (Theorem |7J Lower bound for RGGs). For the pla- 
nar geometric random graph G n with r n — 0(y // logn/n) and 
any spreading policy ir with L max (n) = (9(n 1_<: ) for some 



e e (0, 1], 3 jS > s.t. lim„ 



T w >/3- 



log 4 / 3 ) 



Proof sketch: The method of approach is along the lines 
of that used to prove Theorem [5] along with certain geometric 
considerations for the case of the random geometric graph. We 
introduce a spreading process that spreads 'faster' than ir, and 
show using Lemma [3] that even this process must take at least 
the claimed amount of time to spread. For ease of exposition, 
we break the proof into two steps: 

Step 1: Divide the unit square [0, 1] x [0, 1] row and 
column-wise into r n x r n tiles; there are thus 1/r 2 tiles, say 
fci, . . . , k\ / r z . By standard balls-and-bins arguments, with the 
n nodes thrown randomly into n/ log n tiles, each tile receives 
a maximum of O(logn) nodes with probability 1 — o(l). 

Step 2: Within the event in step 1, we introduce the 
following associated spreading process which, via coupling 
arguments, can be shown to dominate the spread due to it at 
each time t: first, take each tile to be the vertex of a square 
grid where adjacent diagonals are connected. Also, set the 
rate of infection spread on every edge to be Exp(fi log 2 n). 
This effectively upper-bounds the best rate of spread among 
neighbouring tiles. Create a dominating process by creating 
non-interfering clusters at a Poisson rate 1, with each cluster 
growing independently on an infinite square grid with diagonal 
edges and the above spread rate. Lemma [3] shows that w.h.p., 
by time t, 0(t) clusters are formed, and each cluster has at 
most 0(t 2 log 4 n) nodes. Thus it takes at least O ( log 4/3 
time for spreading to finish w.h.p. 

VI. Conclusion 



We have modelled and analyzed the spread of epidemic pro- 
cesses on graphs when assisted by external agents. For general 
graphs, we have provided upper bounds on the spreading time 
due to external-infection with bounded virulence for random 
and greedy infection policies; these bounds are in terms of 
the diameter and the conductance of the graph. On the other 
hand, for certain spatially-constrained graphs such as grids and 
the geometric random graph, we have derived corresponding 
lower bounds: these indicate that random external-infection 
spreading is order-optimal up to logarithmic factors (and 
greedy is order-optimal) in such scenarios. Finally, we have 
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discussed applications of our result to graphs with long-range 
edges and/or mobile agents. 
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